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ABSTRACT 

We present here the first results from two recently completed, fully sampled red- 
shift surveys comprising 3703 IRAS Faint Source Survey (FSS) galaxies. An unbiased 
counts-in-cells analysis finds a clustering strength in broad agreement with other re- 
cent redshift surveys and at odds with the standard cold dark matter model. We 
combine our data with those from the QDOT and 1.2 Jy surveys, producing a single 
estimate of the IRAS galaxy clustering strength. We compare the data with the power 
spectrum derived from a mixed dark matter universe. Direct comparison of the clus- 
tering strength seen in the IRAS samples with that seen in the APM-Stromlo survey 
suggests bo/bi — 1.20 ± 0.05 assuming a linear, scale independent biasing. We also 
perform a cell by cell comparison of our FSS-z sample with galaxies from the first 
CfA slice, testing the viability of a linear-biasing scheme linking the two. We are able 
to rule out models in which the FSS-z galaxies identically trace the CfA galaxies on 
scales 5-20/i~^Mpc. On scales of 5 and 10/i~^Mpc no linear-biasing model can be found 
relating the two samples. We argue that this result is expected since the CfA sample 
includes more elliptical galaxies which have different clustering properties from spi- 
rals. On scales of 20ft-~^Mpc no linear-biasing model with bo/bj < 1.70 is acceptable. 
When comparing the FSS-z galaxies to the CfA spirals, however, the two populations 
trace the same structures within our uncertainties. 

Key vifords: 

galaxies: clusters: general - galaxies: elliptical & lenticular, cD - galaxies: spiral - 
large-scale structure of Universe - infrared: galaxies 



1 INTRODUCTION 



Following the success of IRAS Point Source Catalog (PSC) 
galaxy redshift surveys in probing the large scale structure 
of the local Universe, which presented serious problems for 



the standard cold dark matter fCDM) theory ( 


Efstathiou ei 


a/.199C 


; Saunders et a/. 1991; Kaiser et al.19 


n 




VIoore ei 


al.l992 


; ^trauss et a/. 1992; |Fisher et a/. 1993) two new fainter 



redshift surveys have been undertaken to confirm these re- 
sults at significantly greater depths. 



The first of these surveys (FSS-z sQ) was based on sam- 
ples drawn from t he IR AS Faint Source Data Base [FSDB, 
see Moshir et al. ( |199^ )]. On the basis of deep IRAS cov- 
erage and freedom from cirrus contamination, three regions 
in the Northern Galactic Hemisphere were selected (these 
areas are defined in Table |l|). Within these areas we believe 
the FSDB to be ~ 99 per cent complete for ^eo > 0.2 Jy 
(Lonsdale et al. in preparation). We thus constructed a cat- 



* These surveys have been referred to previously as 'QCCOD' 
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Table 1. Definition of FSS-2 areas. Areas N, A and E constitute 
FSS-2 I while P and X constitute FSS-z II. 



Table 2. Redshift completeness statistics. The subsample has 
sources lying within our 'mask' excluded. Figures in parenthesis 





Definition 


Area/ 


indicate the number of galaxies for which we have redshifts. 








sq deg 




Total 


galaxies 


subsample 


560 > 0.25Jy 


N 


150 deg < / < 210 deg, 


50 deg < fe < 70 deg 


594 


















A 


30 deg < / < 90 deg, 


60 deg < 6 < 68 deg 


178 


N 


1410 


1369 


(1253) 


1369 


(1253) 


924 


(874) 


E 


70 deg < / < 90 deg. 


50 deg <b< 55.5 deg 


67 


A 


474 


457 


(413) 


383 


(361) 


263 


(251) 


P 


26.5 deg < (5 < 44.5 deg. 


fe > 70 deg if a < 12'^ 


476 


E 


107 


105 


(103) 


105 


(103) 


71 


(70) 






or 6 > 68 deg if a > 12'' 




P 


1416 


1391 


(1133) 


1234 


(1062) 


921 


(826) 


X 


32.5 deg < 5 < 38.5 deg. 


fe < 50 deg & a > 8'' 


94 


X 


321 


312 


(236) 


224 


(203) 


146 


(134) 












3728 


3634 


(3138) 


3315 


(2982) 


2325 


(2155) 



alogue of all FSDB sources within these areas having good 
to moderate 60/im fluxes greater than 0.2 Jy. (In area E 
an additional sample was selected with 0.15 < Seo < 0-2 
but this is not considered further in this paper.) The ma- 
jorit y of s uch sources are gal axies [e.g. Rowan-Robinson et 
al. ( |l98e| ), Lawrence et al. (|l986[)]. No formal colour-cuts 



were employed to exclude either stars or cirrus sources. The 
former were excluded upon inspection of the POSS plates or 
APM cartoons together with a case by case examination of 
the colours. In the combined FSS samples only five 'stars' 
lay close to the galaxy colour locus and they had the same 
colours as the 60/im excess stars. In addition 13 galaxies 
were excluded because a bright star lay in the field making 
identification and acquisition impossible, these are assumed 
to be random line of sight coincidences with little effect on 
our analysis. Cirrus sources were rare because of our care- 
ful selection of areas (four sources have been excluded upon 
examination of maps made from the raw IRAS data). This 
survey was particularly designed to clarify the evolutionary 
behaviour of the IRAS popu lation; see contrast ing c onclu- 
sions o f Saunders et a/.(|l990|) and Fisher et al. (|l992|). 



Th 3 second survey (FSS-z II ) was construct ed from the 



Faint Source Catalog Version 2 ( Moshir et a/. 1992 ). The area 
selected traversed the North Galactic Pole connecting the 
FSS-z I areas. In other respects the FSS-z II samples were 
selected in the same manner as FSS-2 I. Designed principally 
for large scale structure studies, the fiux completeness in 
areas P and (to a lesser extent) X is not as good as other 
areas and is estimated to be 90 per cent at 0.25 Jy. 

The combined surveys cover an area of 1310 deg^ (0.4 
sr) and contain 3728 sources, more than 3600 of which are 
galaxies. 

Redshifts were obtained from the literature for 872 
sources; a number of redshifts were kindly provided in ad- 
vance of publication by: John Huchra, Ray Wolstencroft, 
Quentin Parker & Roger Clowes and Marc Davis & Michael 
Strauss. Major observation programmes were instigated to 
obtain the redshifts for the remaining sources using the FOS 
and FOS2 instruments on the INT and WHT facilities. Us- 
ing automatic, optimal extraction techniques and line fit- 
ting procedures we obtained an average redshift accuracy 
of 



190km s'^ (Oliver 1993 



and Oliver et al.in prepara- 
250km s~^ for the QDOT survey 



tion), c.f. an accuracy of 
(Lawrence et al, in preparation). 

Of all 1931 FSS-z I galaxies we currently have 1769 
redshifts giving an overall redshift completeness of 91.6 per 
cent; while for the FSS-z II project we have redshifts for 80.4 
per cent of the galaxies. A mask has been constructed which, 
as well as defining the survey boundaries, excludes a number 
of sectors that have not been exhaustively followed up and 
those sources lying close to the IRAS coverage gap. Upon ap- 



plication of this mask the completeness statistics improve to 
92.5 per cent and 86.8 per cent respectively. Above 0.25Jy 
(where FSS-z II suffers less from flux incompleteness) we 
have 95 per cent of the FSS-z I redshifts and 90 per cent 
for FSS-z II. Detailed numbers are listed in Table |^ Of the 
failures the most interesting will be those with either very 
faint or no optical counterparts. VLA maps have been ob- 
tained for all sources in the FSS-z I that are either blank 
within the IRAS error ellipse (to the limit of the sky sur- 
vey plates) or have very faint identifications. i?-band CCD 
images have also been obtained for many of these sources. 
These have yielded a number of further identifications and 
redshifts have been obtained for several of these. Work is 
continuing to obtain the redshifts for the remainder. 

This paper quantifies the large scale clustering seen in 
both surveys and directly compares these samples with op- 
tical surveys. These surveys are also being used for many 
other studies. It was during the first of these proje cts that 



the unique object F10214-I-4724 was discovered (Rowan- 



Robinson et a/. 1991). The completeness and reliability of the 
FSS-z I survey will be detailed in future papers (Lonsdale 
et al.&i McMahon et al, in preparation). Simple tests revea l 
significant evolution within this sample deliver et a/.1995| ). 
This evolution will be elaborated in greater detail by Broad- 
hurst et a/. (in preparation). The appearance of the Bootes 
void which overlaps with areas A & E will be discussed by 
Oliver et a/. (in preparation). The data for both surveys will 
be presented by Oliver et al. (in preparation). 



2 COUNTS-IN-CELLS 

To assess the large scale structure seen in the FSS-z surveys 
quantitatively we have chosen to ap ply th e counts-in-cells 
method described by Efstathiou et al. ( [l990t ; hereafter E90) 
and also used by Loveday et al. (1992). The two-point auto- 
correlation function is a useful descriptive statistic; indeed 
for Gaussian fields it provides a complete statistical profile 
of the field. The counts-in-cells method measures a volume 
integral of this function, the variance, and avoids the prob- 
lem of assigning weight to galaxies in direct measurements of 
the two-point correlation function This analysis is also rela- 
tively straightforward to perform in the presence of a mask 
covering a significant fraction of the sky, a considerable ad- 
vantage over direct determinations of the power-spectrum 
in Fourier space. 
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2.1 Basic Method 

We consider tlie moments of galaxy counts (Ni) over a lattice 
of M cells. Following E90 we define the statistics 



N 



i 



TV. 



(1) 



(2) 



The statistic iV has an expectation value nV, where n is 
the mean galaxy density and V is the cell volume. // the 
cells are independent then the expectation value of 5", (S) — 
n^a^V^, where cr^ is a v 
autocorrelation function: 



ari2)dVidV2. 



(3) 



a is also closely related to the power spectrum as described 
in Section 2.4. 



If the cells are independent and the number of cells is 
large then we can derive an expression for the variance of 
S [var(S')] in terms of the first four moments of the density 
field (equation 4 in E90). However, if we assume Gaussian 
fluctuations then the higher order reduced correlation func- 
tions vanish giving 



var(S) = 



M 



(4) 



This allows us to estimate var(S') for a given sample by using 
the empirical estimates nV « iV and ~ S/N^ in equa- 
tion ^ For non Gaussian fluctuations this underestimates 
the errors and ideally we should use N-body simulations of 
a particular model to determine higher order moments and 
thus estimate the errors. 

A survey volume is divided up into concentric shells 
and these are then divided into Mi cells. The mean number 
density of the cells is constant within a given shell allowing 
density estimations of the variance from each shell. These 
estimates can be combined using the maximum likelihood 
technique. The statistics A'i and Si are calculated for each 
shell and we construct the likelihood function from all shells 



n 



[27rvar(5'i)li/2 



exp 



2var(5'i) 



(5) 



We can now estimate by replacing riiV^ with and 
maximizing the likelihood function numerically. We can jus- 
tify not using a joint likelihood function for the two estima- 
tors, Ni and Si, since the errors in iV^ are much smaller. 
The 68 per cent confldence limits to this maximum likeli- 
hood solution are the values of where the log likelihood 
has dropped by a factor of 0.5 from its maximum value. 
This maximum likelihood technique automatically weights 
the shells, shells with few cells or sparse density receiving 
less weight. 

To determine how well the data are modelled by our 
msiximum likelihood solution we also calculate the of each 
fit. 



^ 2^ var(S'.) 



(6) 



where the number of degrees of freedom (i/) is one less than 
the number of shells. 



2.2 Detailed Method 

In this Section we discuss finer points of the counts in 
cells method. Specifically we propose two refinements to the 
method of E90. 

In practice each cell does not have exactly the same 
volume owing to the non-uniform mask. We correct the es- 
timators TV and 5* for incomplete volumes as prescribed by 
equation 9 in E90 (N.B. equation 9 in E90 has a minor ty- 
pographical error) i.e. 



N 





- TVV) 


2 

- 


1 - 




EN 


TV2 











(7) 



(8) 



This correction only takes into account the fact that the 
expected number of galaxies in a restricted cell will be re- 
duced. In addition, a masked cell will, in general, sample 
smaller scales. Since it is observed that clustering is stronger 
on smaller scales this means that the expected increases. 
This bias is difficult to correct for since it depends in detail 
on the topology of the mask and the slope of the correlation 
function. We have elected simply to exclude those cells with 
more than half the volume masked. 

At large distances the number of galaxies per cell has 
dropped significantly so that we no longer get a meaningful 
addition to our estimate of the variance. E90 showed that if 
TV(T^ ^ 1 the variance is inversely proportional to the vol- 
ume. Similarly it can be shown that if TVcr^ <^ 1 and TV ^ 1 
the variance on actually increases with additional cells. 
Thus TV(T^ = 1 represents some measure of useful depth of a 
survey (not surprisingly this is where the Poisson variance 
equals the clustering variance) . Although the maximum like- 
lihood analysis includes a weighting scheme there is little 
point in taking shells much beyond this limit. We estimate 
this depth using fro m E90 and the luminosity function 
of Saunders et al. ( 1990| ) and only consider shells to roughly 
twice this depth. 

It has been claimed that the counts- in-cells method as it 
stands is independent of the galaxy selection function [E90, 
Efstathiou (1995)]. This is not strictly true. Since the selec- 
tion function is not constant across each shell the galaxies 
will tend to be concentrated towards the nearest side. This 
reduces the effective volume of the cells and (as with mask- 
ing) will bias the variance upwards. With prior knowledge of 
the clustering strength and selection function one could esti- 
mate and remove this bias. Instead of this we eliminate this 
bias by volume limiting each shell (i.e. removing all galaxies 
that would not have been visible if placed at the far side of 
their shell). The removal of galaxies should not seriously af- 
fect our final errors since they are not sensitive to the mean 
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density. As expected this correction reduces the variance in 
almost every case. For QDOT in particular the reduction 
was considerable, up to 1.7 times the quoted errors. Com- 
fortingly the usually decreased, showing that the fit was 
better. 

We found that the variances obtained by the counts-in- 
cells method were quite sensitive to the specific grid pattern 
used. To reduce this sensitivity we use 16 different grids for 
each survey. First we divide the volumes into equal thick- 
ness concentric shells, then divide these shells with latitude 
and longitude cuts such that the resulting cells have equal 
volumes (in the absence of any mask). These latitude and 
longitude cuts are then offset by 1 /2 and 1 /4 spacing in both 
dimensions to produce eight different grids. This entire pro- 
cess is then repeated with the shells offset by 1/2 a shell 
thickness. Each grid gives us la upper and lower limits to 
the variance. We take the linear average of these over the 
16 grids to produce a final estimate of the upper and lower 
limits on the variance. In the process we also produce an es- 
timate of the scatter between the individual grids; in many 
cases it is comparable to our quoted errors. By averaging 
over 16 different grids we have thus undoubtedly reduced 
the errors. It would be possible to account for the reduction 
in systematic errors by subtracting the grid error (estimated 
from the scatter between different grids) in quadrature from 
the maximum likelihood error estimate. However, in some 
cases our grid error is actually larger than our maximum 
likelihood error, and this procedure would lead to negative 
variances. This is a further indication that the maximum 
likelihood errors are underestimated. So a correction for the 
reduction in systematic errors due to the re-gridding is not 
possible without using full non-Gaussian error estimates. 
Figure ^ shows for each of the 16 grids used for our FSS-2: 
surveys together with the final averaged results. 

To assess the goodness of the maximum likelihood so- 
lutions we also sum up the and u over all grids. Since 
variances determined from the shells of one grid are not in- 
dependent from the shells in another grid we have overesti- 
mated the number of degrees of freedom, hence the overall 
goodness of fit will be optimistic. 

In equation^ we estimate n^V^ with the biased estima- 
tor Nf . We have investigated replacing Nf with the unbiased 
estimator [{J^i N)'^ - J2i N^]/M{M - 1) but did not find 
any significant differences. 

A further bias arises because the cells are not indepen- 
dent. As discussed by E90 and Loveday et al. (1992) this af- 
fects the estimate of var(S). However, correlations between 
cells also bias the estimator S downwards. Ideally each shell 
should include many uncorrelated volumes. A small area sur- 
vey will have fewer uncorrelated cells per shell than a large 
area survey of the same volume. We thus might expect our 
FSS-z to show less variance than the all sky surveys and 
since we do not see such an effect we assume that this bias 
is small. 

2.3 Results 

We apply the above method to our new surveys and also 
to existing IRAS redshift surveys: the latest version of 
the QDOT s urvey (Lawrence et al. in preparation); the 
1.2 Jy survey ( [Fisher et aL1995| ); a faint QDOT sample (0.6- 
1.2 Jy); a combined PSC (QDOT and 1.2 Jy) sample. We 



Figure 1. as a function of cell dimensions (l) for each of the 
16 grids of the FSS-2 surveys (error bars not plotted). Also shown 
is the average of these 16 with averaged error bars 



also combine our FSS-z sample with the combined PSC sam- 
ple to produce a single IRAS sample. 

For our FSS-z surveys we flux limit the whole sample 
at 0.25Jy to allow for the varying completeness limits in 
different areas. The mask was described in Section ^ 

For the QDOT and faint QDOT samples we use the 
standard QDOT mask. The faint QDOT sample has 0.6 < 
Seo < 1.2 Jy so the volume limiting is rather harsh. 

In the analysis of the 1.2 Jy sample we apply a cut of 
|6| > 5deg together with the QDOT mask. This is slightly 
more severe than the 1.2 Jy team used themselves. 

Since the faint QDOT sample has no galaxies in com- 
mon with the 1.2 Jy survey the two can be combined to 
construct a larger PSC sample. For this PSC sample we use 
the same mask as we used for the 1.2 Jy catalogue. Many 
fewer galaxies are excluded when volume limiting the com- 
bined PSC sample than are excluded when volume limiting 
the samples separately. 

Finally, we take the individual variance estimates from 
each shell of our combined PSC sample, together with the 
individual variances from each shell of our FSS-a sample, to 
construct a single likelihood function (equation ^) , as before. 
Maximizing this likelihood function gives us a single variance 
estimate from all samples. Since the volume of overlap be- 
tween the FSS-z and the PSC is small, any interdependence 
will not be a significant problem. 

The results for all the various samples described above 
are given in Table ^. 

In many cases the a-nd f values indicate that the 
goodness of fit for the individual grids is poor. As discussed 
in E90 the assumption of Gaussian fluctuations causes the 
errors to be underestimated which may explain this. Since 
we reduce the systematic errors by averaging over 16 grids, 
without adjusting our error estimates accordingly we have 
to some extent compensated for this. 

The QDOT variances appe ar slig htly smaller than those 
quoted by E90 and Efstathiou ( 1995| ). The main reason for 
this is that by volume limiting each shell we have removed 
the bias due to gradients in the selection function. The grids 
also play a part in this apparent discrepancy; without vol- 
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Table 3. ct^ for the various samples, averaged over 16 grids in 
each case. Including a-nd grid scatter estimate 
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ume limiting, one grid did produce a variance as high as the 
E90 variance at 40/i~^ Mpc. Notice that the scatter between 
grids is always comparable to our quoted error (i.e. roughly 
the maximum likelihood error from a single grid). The ex- 
planation fo r the high E90 variance at 40/i~^ Mpc given by 
Efstathiou (1995) is that there was an upward statistical 



fluctuation in the number of QDOT galaxies in Hercules. 
We believe that this fluctuation had a bigger impact on the 
E90 analysis than on ours partially because we used mul- 
tiple grids and partially because we removed the selection 
function bias. 

The 1.2 Jy results, on the other h and, appear slightly 
larger than the results of Fisher et aZ.(1994). This may be 
due to difi'erences in the masks. 



In comparing estimates of clustering from different sur- 
veys we can assume that the ratio of variances is independent 
of scale and use a weighted average of the log of this ratio 
(with associated and v) to estimate ~ ^^o"! /cr^ . We 

flnd 6qdot/&i.2 Jy = 0.98 ± 0.06, &fss/6i.2 jy = 1.03 ± 0.06 
and fepss/fegDOT = 1.06±0.07. All have acceptable values 
(indicating that the scale independence is justifiable) and on 
each scale the variances between surveys are consistent. All 
are consistent with fca/fct = 1 and in addition there is no 
significant disagreement between the variances measured by 
the different surveys on any scale. This can be seen in Fig- 
ure ^ This in turn justifies our combining the samples. (In 
these comparisons we have assumed that the errors from 
each survey are independent. The many galaxies in com- 
mon between QDOT and 1.2 Jy suggest that they are not 
in fact independent so we have overestimated the errors in 
feQDOT/6i.2 Jy However, the true errors would need to be 
much smaller for the the surveys to disagree significantly.) 

The agreement between QDOT and 1.2 Jy is slig htly a t 
odds with the conclusions of Tadros & Efstathiou's (1991:) 
power spectrum analysis of the two surveys. They find a 
discrepancy between the two surveys which disappears after 
the exclusion of the Hercules region. Their analysis uses a 
select ion function derived from the 60/im luminosity func- 
tion (Saunders et aZ. 199C). It is possible that their analysis 
is more sensitive to the Hercules anomaly as a result of er- 
rors in this selection function. At the distance of Hercules, 



errors in the evolutionary term of the Saunders et a/. (1990) 
luminosity function contribute a systematic 7.5 per cent un- 
certainty to their selection function. Our analysis is indepen- 
dent of the selection function. Differences between the mask 
used by Tadros & Efstathiou and the one we use could also 
have affected this comparison, by excluding significant fea- 
tures in one or other analysis. 

A more detailed comparison of the 1.2 Jy survey and 
QDOT survey shows th at they are consis tent with a single 
underlying density field (Efstathiou 1995). 



We can apply the same test to compare the variances 
of these surveys with the standard CDM variances quoted 
by E90. For none of the surveys does the weighted ratio of 
galaxy to CDM variances provide an acceptable fit at around 
99 per cent significance. Using the full IRAS sample we find 
a weighted average 6iras/6cdm = 0.95 ± 0.02 (this average 
is dominated by the smallest scales). The fit is bad, ~ 
17.0, = 4, allowing us to reject the CDM model at the > 99 
per cent level. This is clearly shown in Figure ^ where the 
slope of the variances with scale from the combined IRAS 
data is clearly at odds with the CDM slope. 



2.4 Comparison With Model Power Spectra 

Theoretical clustering predictions in the linear regime are 
frequently expressed in terms of the power spectrum P(fc) 
or in dimensionless form oc fc^P(fc). The quantity is 
defined as the variance per Infc i.e. A^ = da^ /d\nk. For a 
power law spectrum (A^ oc k"^"^) the fiuctuations within 
Gaussian spheres (radius Rq) is given by 



k = 



A^(fc) 
2^ 2 '\ 



l/(n + 3) ^ 



(9) 
(10) 
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Figure g shows the predicted Fourier amphtudes from 
the FSS-z sample, QDOT, the 1.2 Jy sample and the PSC- 
FSS sample, calculated from equations ^ and ^ and using 
the results from Table ^ We also show the expected results 
from the APM power spectrum inversion of Baugh & Efs- 
tathiou (1994). We have assumed that the linear IRAS bias 
is bi = 0.7 and the optical APM galaxies have a bias of 
bo — 0.9, consistent with the results of Section 3, below. 

These results are compared with the Mixed Dark Mat- 
ter (MDM) model, which has previously been show n to be in 
good agreement with l arge scale clustering data (Taylor & 



Figure 2. Counts in cells variance as a function of scale: Top: 
1.2 Jy Survey - solid line and stars; QDOT Survey - dashed line 
and circles; FSS-2 - dot/dash line and crosses. Bottom: Standard 
Cold Dark Matter (E90) - sold line and stars; all IRAS surveys 
together - dashed line and circles. 



Rowan-Robinson 1992| , Schaefer & Shafi 1992). The model 
shown has Qhdm = 0.15 and h — 0.5. taken from the re- 
sults of van Dalen & Shaefer ( 1992 ), and is normalized to 
the COBE spectrum assuming n — 1, yielding an ef fectiv e 
quadrupole of 19.9 ± 1.6/iK derived by Gorski et a/. ( 1994 ). 
We have also plotted the scale invariant n = 1 Harrison- 
Zeldovich spectrum. 

Figure |^ again shows that there is now broad agree- 
ment between QDOT, 1.2 Jy and the PSC-FSS surveys. The 
MDM model is a good fit to the APM data, although it ap- 
pears to have a slightly steeper slope than the IRAS data. 
This fit could not be improved by increasing the proportion 
of hot dark matter (HDM) because, with the same COBE 
normalization, we would then predict too much power on 
all these scales. Improvements could be made, either by ad- 
justing the primordial spectral index, or by introducing a 
scale dependent bias. There are a number of reasons why 
the small discrepancies between the MDM model and the 
IRAS data do not justify such fine tuning. First, there are a 
number of debatable assumptions that have been made in or- 
der to predict the non-linear, redshift-space power spectrum 
from a linear theory, real-space power spectrum. Secondly, 
the assumption of Gaussian fluctuations in the data analy- 
sis causes us to underestimate our errors as described above. 
Thus we would conclude that our data are not incompatible 
with the MDM model. 



For cubical cells (V = l^) the sam e expression applies but 
with Rg l/\/T2 (Peacock 1991). The theoretical predic- 
tions relate to the real-space mass fluctuations whereas the 
we have calculated is for galaxies in redshift-space. A 
common assumption is that the real-space galaxy and mass 
variances are linearly related by a bias parameter b (see e.g. 
equation |l^, below) i.e. cr^ = Ug/b^, while the redshift- 
space/real-space distortions on large scales can be corrected 
for using the relation 



2 _ 2 



. 2 /n' 
^+3 - 



0.6 



b 



(11) 



( [Kaiser 19871 ). 

We also make a functional correction to the model 
power spectrum to provide an approximation to the non- 



line ar e volution of the power spectrum ( Peacock fc Dodd; 
1994|)7] n addition we use the Peacock & Dodds ( [199^ cor- 



rection for the redshift-space distortions which accounts for 
small scale 'finger of god' effects (for this we assumed a pair- 
wise velocity dispersion of 450 kms~^ added in quadrature 
with the redshift measurement error of 190 kms~^). 



3 COMPARISON WITH OPTICAL 

An interesting question in the study of large scale structure 
is how the clustering properties of different galaxy popula- 
tions are related. Since the various structure formation theo- 
ries such as CDM predict the mass distribution, whereas red- 
shift surveys probe only the galaxy distribution, we need to 
know how these are related. If different classes of galaxies do 
not trace the same structures then clearly at best only some 
of these classes can be faithfully tracing the mass. With an 
increased understanding of how different classes are related 
we can hope to gain some understanding of how the galaxies 
relate to the mass. This question is particularly pertinent to 
the interpretation of flux limited redshift surveys where the 
mix of populations changes with both the flux limit and the 
redshift. Any survey that contains two (or more) popula- 
tions with different clustering properties is inevitably going 
to produce ambiguous results dependent on selection func- 
tions of the classes involved and the scales being considered. 

A very simple possible relationship between galaxy pop- 
ulations is that they linearly trace some underlying density 
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gest bo /hi = 1.4±0.1, ho /hi = 1.0±0.3 or ho /hi = 2.0±0.6 
depending on which correlat ion functions are compared ( La- 



Figure 3. Square root of the Power spectrum, A(fc), for the FSS- 
2 (stars), QDOT (circles). 1.2 Jy (triangles) PSC-FSS (squares) 
and APM (open circles). The data points have been offset along 
the k axis for clarity. The curved solid line is the Mixed Dark 
Matter model with 15 per cent hot dark matter and h = 0.5, nor- 
malized to COBE, and corrected for nonlinear effects. The broken 
line is the linear spectrum. The straight solid line is the extrapo- 
lated scale invariant Harrison-Zeldovich spectrum. The data has 
been corrected for linear redshift space distortion in the small an- 
gle regime, and a linear bias of 6j = 0.7 for the IRAS galaxies 
and bo = 0.9 for the optical APM galaxies. 



field subject to finite sample errors, i.e. 



1 5pa 

ha pa 



(r) 



1 Spb 
hb Pb 



(r) 



1 Spc 

he pc 



(r), etc. 



(12) 



(If the galaxy distributions were identical then obviously 
ha = hb etc..) A more general relationship could be expressed 



^(r) 

pa 



pb 



(13) 



in which case the linear-biasing model would be the first 
order approximation in the case of small Under the 
assumption of linear-biasing one can use the ratio of clus- 
tering amplitudes and similar statistics for different galaxy 
classes to estimate the relative bias parameters [e.g. a'^/o'l = 

[ha/hbf]. 

Comparisons of optical and infrared galaxy clustering 
have been mad e, but give confiicting answers. Babul & 
Postman (199C) compare IRAS galaxies and CfA galax- 



ies in the first CfA slice and conclude that they are con- 
sistent with same underlying galaxy distribution outside 
the core of the Coma cluster (i.e. ho /hi — 1). The ratio 
of variances between the Q DOT and APM-Stro mlo sur- 
veys, uo/'^i ~ 1-0 ± 0.35 (Loveday et a/.1992) implies 
bo /hi = 1.0 ± 0.18 (95 per cent confid ence), while power 
spectra analysis suggest bo /hi = 1.3 (Peacock & Dodds 



1994|)7J)ynamical estimates of /3 = OP'^ /b from IRAS sam- 
ples lie between fii — 0.6 and /3j = 1.3 with excursions 
to /3i = 0.25; optical estimates lie between /3o ~ 0. 4 and 
/3o = 0. 75 with earlier estimates as low as /3o = 0.1 (Dekel 
1994 |). '| rhe ratio Pi / Po = ho /hi but the wide range of esti- 
mates allows 0.8 < ho /hi < 3.3. Ratios of cross- and auto- 
angular-correlations between the UGC ESO and IRAS sug- 



hav, Nemiroff fc Piranl99(]| ). On small scales the 2 Jy/CfA 
cross-correlation amplitudes suggested ho /hi = 2, while 
ho /hi — 1 was found when compari ng the 2Jy surve y 



with the Sou thern Sky Redshift Survey (Strauss et a/. 1992) 
Fisher et a/.(1994) found ho/bi = 1.38±0.12 when compar- 



ing (T8 from the 1.2 Jy survey with the optical ^(r) of Davis 
& Peebles (psl. 



We can use our estimates of the variance in cells to- 
gether with that from an optical survey to estimate bo /hi. 
Using our PSC + FSS-z sample with the APM-Stromlo 
variances in cubic cells gives bo /hi — 1.20 ± 0.05 with 
~ 1.2, ly — A. This is consistent with some of the lower 
determinations above but rules out ho /hi = 1. 

There are two problems with these analyses. First they 
do not test the underlying hypothesis that the distribu- 
tions are linearly biased with respect to each other. Secondly 
the IRAS and optical classes are not clearly defined galaxy 
classes. What constitutes an IRAS galaxy depends on both 
the flux limit used and the redshift at which you find it, and 
the same is true of optical galaxies. Most notably a given 
galaxy can be seen in both optical and IRAS surveys. 

In this section we are going to explore the relationship 
between the clustering of IRAS and optical galaxies taken 
from our FSS-z surveys and the CfA samples. By design, 
the FSS-z surveys overlap with the first three strips of the 
CfA 2 survey. We are thus in a position to compare den- 
sity fields directly point-by-point, rather than by comparing 
global clustering statistics. We will define IRAS and optical 
galaxies such that the classes are unique (no optical galaxy 
can also be an IRAS galaxy) and on the basis of a physical 
distinction that (to first order) is independent of redshift. 



3.1 The CfA and FSS-z Samples 

The optical parent catalog ue is the only published strip 
in the second CfA survey (de Lapparent et a/. 1986); i.e. 



26.5 deg < 5 < 32.5 deg, 8'" < a < 16". The IRAS par- 
ent catalogue comprises the FSS-z samples discussed above, 
including all galaxies down to 0.2 Jy. 

Since our aim is to compare the structures traced by dif- 
ferent galaxies within the same volumes we must first restrict 
the two parent surveys to have the same areal profile. This 
means excluding CfA galaxies which lie within our FSS-z 
mask and also any FSS-z galaxies outside the CfA bound- 
aries. These cuts leave us with 976 FSS-z galaxies and 854 
CfA galaxies. 

Of course many galaxies that appear in the CfA cata- 
logue will also be found in the FSS-z. We identify all such 
pairs using a 3' search radius. If more than one partner is 
found for a given source the match is made on the basis 
of brightness, nearness etc. (the details of the duplicate re- 
movals are relatively unimportant since most duplicates are 
neighbouring galaxies) . One of these pairs was found to have 
a much larger redshift in one catalogue and, since the an- 
gular separation was also large, this match was rejected as 
a chance projection. This matching left us with three cat- 
alogues: 554 galaxies seen only in the CfA 2 survey; 300 
galaxies seen in both surveys; 676 galaxies seen only in the 
FSS-jz surveys. These three catalogues are statistically inde- 
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pendent, in the sense that no galaxy appears in more than 
one sample. 

We now require a more physically meaningful classifi- 
cation of 'IRAS' and 'optical' galaxies. The mix of galaxy 
types in our three samples depends to a large extent on the 
survey selection functions, for example a nearby elliptical 
may be seen in both surveys but a similar galaxy would be 
unlikely to be detected in the FSS-z sample at a greater dis- 
tance. To obviate this problem we have introduced a 60/im, 
B magnitude colour cut which neatly divides the common 
sample into two. The colour chosen is simply the ratio of the 
respective survey limits i.e. 

^ ^ lQ0.4(15.5-m_B) Q.-, 

0.2 ' ^ ' 

Naturally all objects seen only in the FSS-2; survey will 
have a colour that is more infrared than this limit and vice 
versa. The only dependence on redshift that is now present 
in our classification enters through the difference between 
the slopes of the spectral energy distributions around the 
60/im and B-bands. Applying reasonable K-corrections at 
z = 0.3 this flux criterion corresponds to a cut at is > 2L60 
and although this cut is essentially arbitrary, it provides a 
much more appealing definition of an IRAS galaxy or op- 
tical galaxy than its presence or absence in a flux limited 
catalogue. 

After applying this cut we arrive at two samples: 

866(735) IRAS galaxies, 
664(639) optical galaxies, 

where the numbers in parentheses indicate those sources 
that have redshifts. 

We already know that elliptical ga laxies do not trace 
the same density held as other galaxies (Dressier 1980) and 



we also know that ellipticals are under-represented in IRAS 
sample ( [de Jong et aL1984 ). So a priori we would expect 
some difference between the clustering of these two surveys 
simply from this morphological effect. This will be partic- 
ularly important in this comparison since the CfA strip is 
strongly dominated by the Coma cluster. We therefore cre- 
ate a further sample in which we restrict the optical galaxies 
to exclude all elliptical galaxies. To identify the ellipticals 
we use the morphological classiflcations in the CfA cata- 
logue and exclude all types earlier than Sa. This gives us an 
optical-spiral sample with 269(267) galaxies. 



the optical at this depth, nevertheless with the IRAS galax- 
ies to guide the eye one can pick it out. Beyond 10000 kms~^ 
between 12'' and 13'' there appear to be some voids of low 
signal-to-noise ratio in the optical data, but these are much 
less pronounced or non-existent in the IRAS data. In the 
optical-spiral sample we see that the Coma cluster is far less 
signiflcant (as we would expect). More detailed observations 
are strongly limited by the sparsity of the galaxies. 

Compressing the cone diagrams in the tangential di- 
mension increases the signal-to-noise ratio at larger radii 
(Figure ^). Here we can more clearly see the second wall in 
the IRAS sample and guided by that we see a second peak 
in the optical data. Notice that the peak corresponding to 
the 'Great Wall' is a single broad peak in the optical but has 
two peaks in the IRAS data. This arise from the difference 
in selection functions. 

The cone diagrams above are dominated by the se- 
lection functions of the various samples. Dividing the vol- 
ume into cells within concentric shells we can estimate 
p/p = ni/^rii for each class within each cell. In Figures]^, ^ 
we have plotted the densities of each cell as seen in the opti- 
cal sample against the densities as seen in the IRAS sample. 
Since there is no overlap between cells all the data points 
are independent. We have not plotted error bars which would 
have confused the plot. It should be noted that the under- 
densities will usually have fewer galaxies and therefore larger 
errors. The general trend in these plots seems to be a steeper 
slope than would be expected for identical clustering, or even 
a linear-bias model with ho /hi = 1.3. On larger scales this 
appears to be true both above and below the mean density 
indicating that, in the optical, not only are the clusters more 
dense but also the voids are less dense. On scales of 5/i~^ 
Mpc the IRAS density appears to be higher than the opti- 
cal density for densities larger than the mean density. This 
might be explained by flngers of god which would tend to 
smear out the optical clusters. 



3.3 Contingency Table 

We now turn to a quantitative comparison between t he tw o 
samples. This method was flrst discussed by Oliver (1993). 
Recently a similar but statist ically more rigo rous method 
was brought to our attention (Efstathiou 1995). 



3.2 Qualitative comparison 



3.3.1 Method 



The Figure g shows the cone diagrams for the three subsam- 
ples described in Section 3.1. In the optical sample the most 



striking structure is the Coma cluster and 'Great Wall'. In 
the IRAS sample the Coma cluster is considerably less pro- 
nounced, although the 'wall' itself appears equally promi- 
nent. The difference in appearance of the Coma cluster be- 
tween the two samples is easily explained; the optical sample 
contains more ellipticals which are preferentially found in 
rich clusters. Clearly the IRAS sample probes much deeper 
than the optical sample although they have similar numbers 
of galaxies (this is because of the breadth of the IRAS lumi- 
nosity function) i.e. the IRAS galaxies are sparser. There is 
some indication of a second 'wall' at around 15000km s~^ in 
the IRAS sample. Although this feature is poorly sampled in 



The most obvious scheme to test is that the IRAS and opti- 
cal galaxies cluster in exactly the same way (i.e. ho /hi — 1). 
Our null hypothesis is thus that the over-density in any given 
cell is independen of galaxy class. We divide our survey vol- 
ume into shells and cells as we did for the counts in cells 
analysis. Under our null hypothesis we are at liberty to com- 
bine the samples to give a single estimate of the 'galaxy' 
over-density in any cell. From this estimated over-density 
and the observed over-de nsities in each individua l class we 
construct a statistic, X'^ , ( Kendall fc Stuart 1979| ) which we 
compute across the whole shell; 



E{nij — nii,nt,j/n) 
nii,ni,j/n 



(15) 
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Figure 4. Cone Diagram of FSS-2;/CfA2 Strip 1 intersection. Top 
— IRAS Galaxies. Middle - optical galaxies. Bottom — optical- 
spiral galaxies. 



Here 11 (= riij) represents the total number of galaxies 
from both surveys in a given shell. The subscript i refers to 
the individual cells in the shell and j to the galaxy class. 
Wo have employed the notation rii* = Uij to represent 
the marginal distributions. Here we have implicitly assumed 
that the observed galaxies give us an estimate of the un- 
derlying galaxy-density field, and that the measured difFer- 



Figure 5. N{z) diagrams. The top diagram shows the N(z) = 
p(z)dV for the IRAS sample while the middle diagram shows the 
same for the optical Sample, the bottom diagram is the ratio 
between the two (together with a power-law fit) this gives us 
pj / po as a function of z, independent of clustering (if there is no 
relative biasing) 
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Figure 6. po/pb Pl/pi diagram of FSS-z/CfA2 Strip 1 inter- 
section. Densities are calculated in nearly cubical cells of sizes 5 
(top) and 10 Mpc (bottom). The straight line would indicate 
identical clustering, the curved line would be followed in a linear- 
biasing model with bo/bi = 1.3. All points are independent so 
the scatter is real, to avoid confusion we have not plotted error 
bars. 

ence in densities between the two populations is governed by 
Poisson statistics, i.e. that the additional error terms due to 
clustering (and cross-correlations between cells) cancel out. 
In this case is asymptotically distributed as and, when 
summed over all shells, has M — N degrees of freedom where 
M is the total number of cells and A'' is the number of shells. 



Figure 7. As for Figure |6| but for scales 20 (top) and 30 h ^Mpc 
(bottom). 

3.3.2 Selection Function 

As with the counts-in-cells analysis this analysis is affected 
by the selection functions of the samples. If the relative den- 
sity of the two samples pi/pb changes significantly across 
a shell then this will introduce spurious discrepancies be- 
tween the optical and IRAS clustering. As before, we could 
volume limit each sample within each shell. This discards 
more galaxies than necessary and for this analysis our statis- 
tics are sensitive to the mean density. To avoid any bias we 
require pi/pb to be constant over each shell. We can es- 
timate this quantity neatly, without calculating either se- 
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lection function independently, and (in the case of no rel- 
ative bias) independently of clustering. It is simply the ra- 
tio of the two p{z) distributions as shown in Figure ^. We 
have fitted the observed ratio with a power law giving us 
pi/pb{z) oc (1 + z)^'' . In any given shell we then randomly 
throw away galaxies from the most dense sample to give us 
a constant pi / po over that shell. This is very efficient since 
the errors are dominated by the least dense sample and in 
any case the gradients across a shell are small. Our results 
were not significantly affected by this correction. 



3.3.3 Cells with few galaxies 

Technically we can calculate for all cells where there 
is at least one galaxy in each class but the statistic is 
only distributed as in the limit of large numbers so it 
is unwise to take those results at face value. To avoid the 
problem of poorly sampled cells requires some care. If we 
simply exclude cells with fewer than galaxies in any 
class we would end up biasing our results. This is because 
we would be excluding cells with a higher over density in the 
class which has the lower mean density, and thus concluding 
that the other population had more under-dense regions. It 
is thus clear that any thresholding has to be imposed on 
Sp/p rather than n. We thus impose the limit 



> 



n min(ntj) 



(16) 



where we have replaced Sp/p by the estimator Ui-^/n. Now 
our limit, niim, determines the smallest number of galaxies 
of the most sparse class that we would expect in any cell. 
The actual Sp/p limit varies from shell to shell, allowing 
us to extract the maximum information from the surveys. 
These sparse cells still contain useful information, especially 
about the voids. Fortunately our null hypothesis allows us 
to group cells together. We choose to group sparse cells with 
their neighbour but one, rather than excluding them totally. 
We present results for num ~ 1, 2, 3, 4, 5. 



3.3.4 Results 

The results of our contingency table analysis are presented 
in Table |^. The probability P(X^,u) is that of obtaining 
a value of X^ higher than that observed assuming our null 
hypothesis and that X^ is distributed as with v degrees 
of freedom. One can see from this Table that the hypothesis 
that the optical galaxies and the IRAS galaxies are identi- 
cally tracing the same underlying field is ruled out on scales 
5 - 20^"^Mpc with a high level of significance (> 99%). On 
the 30/i~^Mpc with niim = 5 it is impossible to rule out the 
null hypothesis, although the high at smaller may 
hint that our statistics are too poor for this to be informa- 
tive. 

As stated before, we expect some discrepancy because 
of the over-representation of elliptical galaxies in optical 
surveys. In Table ^ we present the results after having ex- 
cluded the CfA ellipticals. Here the story is markedly differ- 
ent. We are unable to exclude our null hypothesis for scales 
5, 10, 20h~^ Mpc. At 30h~^ Mpc we can formally rule out 
our null hypothesis at the 95 per cent level. With only two 
degrees of freedom this result should be treated with cau- 
tion. 
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Table 4. Contingency table analysis results. Under the null hy- 
pothesis that the optically selected galaxies trace the same field as 
the IRAS galaxies P{X, u) gives the probability of obtaining 
higher than the observed value, assuming that X^ is distributed 
as (the statistic X^ is defined in equation ^). is the 

threshold used in merging cells with few galaxies (equation p^ ). 



We thus conclude that the detectable differences in clus- 
tering between these two surveys are mainly due to the 
higher proportion of ellipticals in the optically selected sam- 
ples. It is nevertheless surprising that significant differences 
persist to large scales where we might have expected the 
overabundance of ellipticals in rich clusters to be of less im- 
portance. This could be because of the Coma 'finger of god' 
which is quite extensive in redshift space. Alternatively it 
might suggest that the morphology-density relation extends 
to large scales. 



3.4 Relative Bias Ratio 



The contingency table analysis described in Section 3.3.1 
tests only a very particular linear-biasing scenario (where 
bo = bi). A more general model allows for a relative bias 
between the different samples (equation ^). An extension 
of the above method allows us to test this more general 
model. To allow comparison with estimates of bo/bi from 
other methods (see e.g. Section ^), we will consider a model 
where the optical over-density is a constant multiple {bo /bi) 
of the IRAS over-density. 

We thus construct a series of null hypotheses that the 
two galaxy distributions are linearly biased with a given ra- 
tio of bias parameters. Since we have no direct measures of 
the underlying mass distribution we can only investigate the 
ratio of bo/bi. For clarity we will discuss the biasing of the 
two populations relative to an underlying mass distribution 
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ment with our contingency table analysis if bo/bj — 1): 
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Table 5. Contingency table analysis results. As for Table ^except 
that the CfA ellipticals were excluded prior to analysis. 



on the understanding that our results are only sensitive to 
bo/bj. We could test any given ratio, perhaps arising from 
a physical biasing theory, or a ratio observed from global 
statistical measures such as the b/Q^'^ results from velocity 
fields. Alternatively we could run through a series of possible 
b ratios and see which are the most acceptable. 

To incorporate this biasing ratio we must modify our 
statistic slightly, breaking away from a simple con- 
tingency table analysis. For each cell n^j is estimated as 
Vi ^ . Tiij/ ^ . Vi and An^j = Uij ~ fiij {Vi is the unmasked 
volume of cell i), then 



Oj i^ij Pra 



(17) 



We can thus estimate the underlying mass fluctuations in a 
cell by 



A. 



UijAuij /bjUij 



(18) 



We choose to use the mean densities of the two species 
as weights {wij — Uij). This weighting scheme means our 
analysis will agree with the contingency table analysis at 
bo/bi = 1. Other schemes are possible but they rely on the 
observed values in each cell and are not as robust. 

We are then in a position to construct a new statistic 



var( 



1 Awij Ai + 1 /bj 



(20) 



In the previous analysis we merged cells with an esti- 
mated under-density low enough that, for the class of galax- 
ies with the lowest mean density at that shell, we would ex- 
pect fewer than galaxies in that cell. With our new ap- 
proach a similar argument introduces the following threshold 

A. > max, (1 ^'' ) (21) 
bj riij 

The results presented here have taken num = 5. The Ai limit 
varies with bo/bi, as does Ai itself so the number of cells 
merged depends on the value of bo /bj being tested. 

We calculate the statistic X^ for a range of 0.5 < 
bo/bi < 3 and, under the assumption that this statistic 
is distributed as with M — N degrees of freedom, we 
calculate the probability P(X'^ ,v) as before. 

The results are presented in Figure ^ which shows the 
probability of obtaining a higher value of X^ than that ob- 
served as a function of the assumed biasing ratio bo/bi. 
The curves in Figure ^ are for the four scales 5, 10, 20 and 
30/i"^ Mpc. The curves are not smooth because our cell 
merging threshold depends on the assumed bias ratio and 
jumps occur when new cells become merged. On scales of 5 
to lO/i"^ Mpc all linear-biasing models relating optical and 
IRAS galaxies can be ruled out with around 99 per cent 
signiflcance. At the 5^~^ Mpc scale this is perhaps unsur- 
prising since we are in the non-linear regime {Sp/p > 1) 
so a naive justification for a simple linear-biasing breaks 
down. Also on these smaller scales we expect that the 'fin- 
gers of god' (more prominent for the ellipticals in clusters) 
will have a non-linear impact on our analysis over and above 
any discrepancies in the over-densities. However, on scales 
of 10^~^ Mpc we might have expected that a linear-biasing 
model would have been reasonable. On larger scales (20 and 
30/i"^ Mpc) we do find that there are linear-biasing models 
that we cannot reject. A bias ratio of 1.7 < bo/bi < 3.3 
cannot be rejected with more than 95 per cent significance 
for 20/i~^ Mpc cells and likewise 1.1 < bo/bi < 2.0 is ac- 
ceptable at 30^~^ Mpc. It is interesting that we can reject 
all bo/bi < 1.7 with more than 95 per cent confidence since 
it is within this region that most estimates of bo jbi lie (Sec- 
tion^. Taken at face value this would imply that optical and 
IRAS density fields are not linearly related even on scales of 
20/i"^ Mpc. 

Since we were unable to reject the hypothesis that 
bo/bi = 1 after the exclusion of optical ellipticals, we ex- 
pect a wider range of acceptable bias ratios when comparing 
optical-spirals with the IRAS sample. This is indeed the case 
and for brevity we do not show the P{X^, v) plots for these 
samples. Suffice to say we cannot rule out 0.8 < bojbi < 3 
between 5 and 20/i~^Mpc (all linear-biasing models could 
be ruled out with 95 per cent confidence at 30/i~^ Mpc but 
as before the few degrees of freedom at this scale cause us 
to be sceptical of this results) . 



^2 ^ Y.^,^^'n,,■,|b,n,^ 



var(Anij /bjUij) 



(19) 



The variance is determined from Poisson statistics, neglect- 
ing errors and covariance terms arising from fiij (in agree- 



4 CONCLUSION 

We have completed two new large, deep IRAS redshift sur- 
veys. The large scale clustering properties of these surveys 
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have been analysed and found to be consistent with previous 
IRAS redshift surveys, strengthening previous conclusions 
that the IRAS galaxy distribution is inconsistent with the 
standard CDM predictions. 

We noted two particular shortcomings in the counts-in- 
cells analysis. Firstly, the variances determined using differ- 
ent grid patterns on the same data show a scatter compara- 
ble with the quoted errors. Secondly the method is not com- 
pletely independent of the selection function and we demon- 
strate one way of overcoming this bias. 

We are able to combine all the IRAS samples together 
to obtain a single estimate of the variance on large scales 
with errors that will not be matched until th e completion 



of the fu ll PSC-z survey (Saunders et 1995a; Saunders ei 



a/. 1995b). Converting these variances into the form of a di- 
mensionless power-spectrum (correcting for redshift-space 
distortions), we compare them with a mixed dark matter 
(MDM) model. The slope of this model spectrum appears 
slightly steeper than that observed but the assumptions in- 
volved in transforming theory to data are sufficiently uncer- 
tain that we can not read too much into this. 

A comparison of our variance estimates with those 
from the APM-Stromlo redshift survey suggests bo/bj — 
1.20±0.05, assuming a linear-biasing model with ratio inde- 
pendent of scale. This is hig her than that found by the APM- 



Stromlo team themselves (Loveday et a/. 1992) because we 



Figure 8. P{x^,>^) v bo/bj for IRAS and optical galaxies on 
various scales. 



find smaller IRAS variances than E90. 

Using our FSS-z sample and the first CfA strip we di- 
rectly compare the distributions of optical and IRAS galax- 
ies (using a physically meaningful definition of the two 
classes). This is done on a point-by-point basis so we are 
able to test the hypothesis that the two populations are re- 
lated by a linear-bias ratio {bo/bj). 

We are able to reject the hypothesis that bo/bj = 1 
with a very high degree of significance on the scales 5, 10 and 
20 Mpc. This hypothesis is, however, acceptable if we 
exclude ellipticals and SO galaxies from the optical sample, 
suggesting that the discrepancies are due to the morphology 
density relation and the under-representation of ellipticals 
in IRAS samples. On scales of 30 Mpc our conclusions 
would be reversed but we suspect that our statistics are too 
poor at this scale for this to be very meaningful. 

Allowing bo/bi to vary away from 1 we find that only on 
the largest scales are any optical/IRAS linear-biasing mod- 
els acceptable, if elliptical galaxies are included in the optical 
samples. Even on scales of 20 Mpc, linear-biasing with 
bo/bi < 1.7 is ruled out with more than 95 per cent confi- 
dence. It should of course be stressed that in this particular 
volume the optical galaxy distribution is dominated by the 
Coma cluster and so the distorting effect of the elliptical 
galaxies has to some extent been amplified. 

This non-linear clustering relation between ellipticals 
and IRAS galaxies may present complications for the inter- 
pretation of any galaxy surveys that include both elliptical 
and spiral galaxies. As yet we have been unable to determine 
any differences in the clustering properties of IRAS galax- 
ies and spiral galaxies and so surveys composed of either of 
these can be meaningfully compared. 
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